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A  Sample-Size  Optimal  Bayesian  Procedure 
for  Sequential  Pharmaceutical  Trials 

by 


Noel  Cressie  and  Jonathan  Biele 


SUMMARY 

Consider  a  pharmaceutical  trial  where  the  consequences  of  different  decisions  axe  ex¬ 
pressed  on  a  financial  scale.  The  efficacy  of  the  new  drug  under  consideration  has  a  prior 
distribution  obtained  from  the  underlying  biologiczJ  process,  animal  experiments,  clinical 
experience,  and  so  forth.  In  an  important  paper.  Berry  and  Ho  (1988)  show  how  these  com¬ 
ponents  are  used  to  establish  an  optimal  (Bayes)  sequential  procedure,  assuming  a  known 
constant  sample  size  at  eewii  decision  point.  We  show  in  this  article  how  it  is  also  possible 
to  optimize  with  respect  to  the  sample-size  rule.  This  last  component  of  the  design,  which 
is  missing  from  most  sequential  procediures,  has  the  potential  to  yield  considerably  larger 
expected  net  gmns. 


Keywords:  Backward  induction;  clinicad  trials;  group-sequential  procedure;  optimal  stop¬ 


ping;  predictive  distribution;  VPRT. 
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1.  Introduction. 


Berry  and  Ho  (1988)  argue  cogently  for  a  Bayesian  decision-theoretic  approach  to 
the  design  and  analysis  of  pharmaceutical  trials.  They  (and  we)  address  the  questica  of 
when  a  drug  development  program  should  be  stopped  in  the  face  of  evidence  that  suggests 
insufficient  drug  efficacy.  The  purpose  of  this  tirticle  is  not  to  enter  into  a  debate  about 
Bayesitin  versus  frequentist  methods  for  the  design  and  analysis  of  sequential  clinical  trials, 
but  rather  to  implement  fully  the  Bayesian  technology.  Breslow  (1990),  and  the  discussion 
following  his  article,  contrasts  the  two  methods. 

In  the  pharmaceutical  setting,  it  is  often  reasonable  to  assume  that  the  consequences 
related  to  the  objectives  of  the  trial  can  be  converted  to  a  financial  scale.  In  decision- 
theoretic  terms  (e.g.,  RaifFa  and  Schlaifer,  1961:  DeGroot,  1970),  we  assume  that  the  drug 
company  is  able  to  construct  a  realistic  gain  (or  loss)  function.  Such  a  function  might 
come  from  a  cost-benefit  analysis.  What  we  are  proposing  in  this  paper  is  an  optimtil 
way  to  conduct  the  clinical-trials  part  of  the  drug  development  progrtun,  but  the  same 
ideas  could  be  used  in  later  parts  of  the  program  (where  one  must  choose  between  severzd 
possibilities,  one  of  which  is  to  put  off  the  decision  and  obtain  more  data  to  help  make  a 
decision  at  the  next  time  point).  In  both  circumstances,  the  Bayesian  approach  provides 
the  decision-raaJser  with  not  only  an  optimal  stopping  and  termineJ-decision  rule,  but  also 
with  an  optimal  sample-size  rule. 

Consider  sequential  decision  procedures  that  eillow  seimpling  in  other  than  a  one-at- 
a-time  manner.  Although  group-sequential  sampling  was  suggested  by  Wald  (1947,  pp. 
101-104),  it  was  not  until  much  later  that  interest  in  these  techhniques  (with  constant 
group  size)  was  initiated;  see,  for  example,  Ghosh  (1970,  p.  224),  Pocock  (1977),  and 
Gupta  and  Miescke  (1984).  In  all  these  studies,  at  each  decision  point  the  sample  size  is 
either  assumed  to  be  known  in  some  way  or  is  chosen  in  am  ad  hoc  (perhaps  stochaistic) 
manner.  Here  we  focus  attention  on  the  optimal  choice  of  sample  sizes  as  a  component 
of  a  sequential  decision  procedure  for  pharmaceutical  trials.  In  the  related  problem  of 
sequential  hypothesis  testing.  Whittle  (1965),  Ehrenfeld  (1972),  and  Spaiin  and  Ehrenfeld 
(1974)  considered  special  caises:  Cressie  amd  Morgan  (1988)  gave  a  general  formulation  for 
two  simple  hypotheses. 

Section  2  defines  the  pharmaceuticad  triad  under  consideration  in  this  article  and  es¬ 
tablishes  the  necessary  notation.  In  Section  3,  the  optimal  (Bayes)  sequentiad  procedure  of 
Berry  and  Ho  ( 1988)  is  re-established  amd  then  improved  by  further  optimizing  over  sample 
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sizes.  Two  illustrative  examples  are  given  in  Section  4:  in  one,  the  benefits  of  sample-size 
optimization  are  seen  to  be  substantial.  Section  5  contains  concluding  remarks. 

2.  Sequential  Pharmaceutical  Trial. 

We  now  give  a  brief  description  of  the  problem  considered  by  Berry  and  Ho  (1988) 
and  show  how  our  proposed  sample-size  optimization  mziximizes  the  expected  net  gain 
(ENG)  over  zdl  sequential  procedures.  The  purpose  of  the  sequential  triad  is  to  test  an 
experimental  treatment  compared  to  a  control;  at  each  time  the  same  number  of  data  is 
collected  from  the  treatment  population  as  from  the  control  population. 

The  data  from  the  control  population  is  assumed  to  be  independent  and  identically 
distributed  (i.i.d.)  Gau(/Xctij<^^/2)  and  that  from  the  treatment  population  is  assumed  to 
be  i.i.d.  Gau(/itrt,  distribution  with  mean  and  variance  (Here,  Gau(a,  i>^) 

denotes  a  Gaussian,  or  normal,  distribution  with  mean  a  and  variance  b^.)  Define 

6  =  fitrt  -  Hctl  ■  (2.1) 

We  wish  to  test  sequentially  the  composite  hypotheses 

Ho  :  6  <  So  versus  Hi  :  6  >  So  ,  (2.2) 

where  without  loss  of  generality  we  take  So  =  0.  Suppose  that  a  sample  of  size  n*  is 
requested  at  time  t  and  obtained  at  time  (<  +  1),  for  testing  (2.2).  Let  itn.t  and 
denote  the  corresponding  seunple  means.  Then 

~  Gau(6,<7'/n,);  t  =  0. 1,  •  •  • ,  T  -  1  ,  (2.3) 

where  T  denotes  the  truncation  time  by  which  a  decision  about  Ho  or  Hi  nuat  be  made. 
Clearly,  the  statistics  U7i,u;2,  •  •  •  are  sufficient  for  inference  on  S. 

Define 


which  is  the  sample  mean  of  all  differences  between  treatment  and  control  up  to  time 
(t  -f  1).  It  will  be  seen  that  (2.4)  is  the  basis  of  inference  on  S  made  at  time  (t  -f  1).  Group- 
sequential  testing  usually  assumes  no  =  nj  =  •  •  •  =  nr -i  =  k,  where  k  is  a  predetermined 
seunple  size.  Berry  and  Ho  ( 1988)  make  this  assumption,  but  in  this  article  we  demonstrate 
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that  considerable  gains  axe  possible  by  optimizing  with  respect  to  sample-size  choice  at 
each  time  point. 

We  take  a  Bayesian  approach  to  the  sequential  analysis  of  pharmaceutical  trials.  That 
is.  the  consequences  of  right  or  wrong  decisions  about  6  can  be  distilled  into  a  gain  function. 
sampling  has  a  cost  associated  with  it,  and  we  have  a  prior  opinion  about  6  that  is  here 
expressed  tis, 

6  ~  Gau(i/o,To )  .  (2.5) 

In  line  with  (2.4),  define 

uJo  =1/0  .  (2.6) 

A  Bayes  procedure  maximizes  the  expected  gain  net  of  sampling  costs  or  expected  net 
gain  (ENG).  It  is  usually  defined  as  a  decision  procedure  that  minimizes  risk  or  expected 
net  loss;  however,  an  ENG-maximizing  procedure  is  easily  seen  to  be  Bayes  by  defining 
the  loss  function  to  be  the  negative  of  the  specified  gain  fvmction. 

The  gain  fimction  g  for  a  sequential  pharmaceutical  tried  is  a  function  of  the  value  of 
6  and  which  of  Hq  (i.e.,  A(  =  0)  or  Hi  (i.e.,  At  =  1)  is  specified  at  time  t.  Potentially,  the 

function  itself  could  also  vary  with  t.  In  what  is  to  follow,  we  use  Berry  and  Ho’s  (1988) 

specification: 

0,  if  Af  =  0 

y(6,A,)  =  ■{  -I,  if  A,  =  1  and  6<0  (2.7) 

I  KS.,  if  Af  =  1  and  6  >  0. 

where  L  and  K  are  given  positive  constants.  Berry  and  Ho  (1988.  p.  222)  provide  a 
justification  for  this  gain  function  (they  actually  specify  a  loss  function)  in  the  context  of 
pharmaceutical  trials. 

Sampling  is  costly;  it  is  assumed  that  each  observation  in  the  sample  costs  one  unit 
so  that  at  time  t  the  cost  of  the  next  treatment  sample  and  control  sample  is  2nf.  .A.t 
each  time  point  f,  we  zire  faced  with  a  decision  whether  to  stop  the  trial  or  not.  A  well- 
informed  decision  will  weight  up  the  gains  of  stopping  versus  the  net  gains  to  be  expected 
from  continuing  to  sample  (after  debiting  sampling  costs).  In  the  event  that  the  trial  is 
continued  to  the  time  point  (t  -|- 1),  how  many  observations  should  be  sampled?  We  shall 
show  that  there  is  an  optimal  number  n*,  of  treatment  and  control  samples,  that  depends 
only  on  the  posterior  distribution  of  S  md  the  exogeneous  constants  K  and  L  of  the  gain 
structure. 

In  general,  a  sequential  procedure  can  be  completely  specified  through  a  stopping  rule 
5  =  {5t  ;  t  =  0.  l.  -.T},  a  terminal-decision  rule  A  =  {A/  :  t  =  0.1.--.r}  and  a 
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sample-size  rule  n  =  {nt  :  i  =  0, 1,  •  •  ■ ,  T),  all  of  which  are  functions  of  the  data.  (For  the 
problem  we  consider  here,  5t,  A/,  and  nt  will  be  functions  of  wt  given  by  (2.4).)  When 
St  =  1.  the  procedure  stops  at  time  t\  otherwise  St  =  0.  in  which  case  sampling  continues. 
Suppose  the  procedure  stops  at  time  t:  When  At  =  0,  the  hypothesis  Ho  is  chosen; 
otherwise  At  =  1,  in  which  case  Hi  is  chosen.  The  function  Ut  is  nonnegative-integer 
\'alued;  should  sampling  continue  at  time  t,  it  specifies  the  sample  size  to  be  taJcen  between 
time  t  tind  time  (t  -(- 1).  In  much  of  the  sequential-analysis  literature,  the  sample-size  rule 
n  is  not  mentioned,  since  it  is  usually  assumed  that  one  sample  (or  a  specified  constant 
number  of  samples)  is  taken  at  each  time  point. 

Now,  since  a  decision  must  be  made  by  (truncation)  time  T,  =  1  and  nr  =  0.  Berry 
and  Ho  (1988)  make  further  restrictions  on  their  sequential  procedure.  They  argue  that 
one  should  stop  investigating  an  ineffective  treatment,  but  obtain  as  much  information 
as  possible  on  a  good  one.  That  is,  their  procedure  is  one-sided  and  the  combination 
=  1.  A<  =  1  never  happens  for  t  =  0,  !,•••, T  —  1.  (In  Section  5,  we  show  that  the 
one-sided  restriction  can  be  relaxed.)  Also,  they  assume  rif  =  fc,  a  prespecified  integer,  for 
t  =  0,l,-",r  —  1.  For  most  of  what  is  to  follow,  we  retain  all  aspects  of  Berry  md  Ho’s 
procedure  except  for  the  restriction  on  the  sample-size  rule.  We  shall  demonstrate  that 
there  can  be  considerable  benefits  to  be  had  from  obtaining  optimal  sample  sizes. 

3.  Bayes  sequential  procedures. 

To  simplify  notation,  we  shall  call  the  Bayes  sequential  procedure  proposed  by  Berry 
and  Ho  (1988)  the  B-procedure.  Upon  relaxing  the  sample-size  restriction  in  the  B- 
procedure,  the  ENG-maixirazil  (Bayes)  sequential  procedure  can  be  obtained,  which  we  call 
the  B*-procedvu:e.  By  definition,  since  the  B*-procedure  is  optimal  over  a  less  restrictive 
set  of  procedures  than  the  B-procedure,  the  former’s  ENG  is  larger  them  or  equad  to  the 
latter’s.  We  shall  demonstrate  in  Section  4  that  the  increase  in  the  B*-procedure’s  ENG 
can  sometimes  be  substantial. 

First,  we  shedl  describe  the  B-procedure,  generahzed  slightly  here  to  handle  any  pre¬ 
specified  sample-size  rule  n.  Berry  and  Ho  (1988)  show  it  to  be  Bayes  using  backward 
induction  (e.g.,  DeGroot.  1970,  Section  12.4).  Assuming  the  ENG  of  the  sequentiad  proce¬ 
dure  at  time  (t-|- 1)  is  known,  an  optimal  procedure  at  time  t  can  be  obtained  by  comparing 
the  ENGs  of  stopping  and  choosing  various  hypotheses  with  the  ENG  of  continuing  to  sam¬ 
ple  (which  involves  an  average  over  the  known  ENG  at  time  i  -I- 1).  The  optimal  stopping, 
terminal- decision,  and  sample-size  rules  at  time  t  are  obtained  by  taking  whatever  action 
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achieves  the  mtixiinum  of  till  the  competing  ENGs. 

From  (2.5),  the  prior  probability  that  Hq  is  true  is 

Pq  =  j To)  ,  (3.1) 

where 

$(i)=  /  (27r)-'/2exp(-sV2)<i5  .  (3.2) 

J  — oo 

Let  vt  and  rf  be  the  posterior  mean  and  variance  respectively  of  8  at  time  t.  Then,  it  is 
straightforward  to  show  that 

t-i  t-i 

Vt  =  [voa^  +  ”t)To  }  (3-3) 

i=0  i=0 

+(5Z«.Ko  }  •  (3-4) 

t=0 

Thus,  the  posterior  probability  that  Hq  is  true  is 

Pi  =  ^(st) ,  (3.5) 

where 

St  =  ~vt/Tt;  <  =  0,  l,---,r.  (3.6) 

Since  $  given  by  (3.2)  is  a  one-to-one  mapping,  there  is  no  loss  of  information  from 
considering  the  pair  of  posterior  parameters  pt,  Tt  (or  st,  Tt),  rather  than  vt,  Tf. 

One  further  result  that  is  needed,  to  obtain  the  Bayes  procedure,  is  the  preditive  (or 
marginal)  distribution  of  Wt+i  given  pt,Tt.  Upon  integrating  out  the  conditional  density 
of  8  given  pt,r/,  we  obtain  the  predictive  density  qt+i  of  wt+\',  that  is. 

wt+i\pt,rt Gau{-^~^(pt)Tt,Tt  +{(r'^/nt))  ■,  #  =  0,  •  •  • .  J  -  1  .  (3.7) 

Following  Berry  and  Ho  (1988),  the  ENG  for  stopping  at  time  t  and  choosing  Hi  is 
(from  (2.7)): 

Hpt^Tt)  =  -Lpt  -I-  E{K8\8  >  Q,pt,Tt)(l  -  Pt) 

=  -Lpt  +  KTt{<l>(^~Hpt))  +  ^~^(Pt){l  -  Pt)]  t  =  0.  l.---.r.  (3.8) 

where 

d{x)  =  ^'(x)  =  {2n-)~‘/2  e.xp(-.rV2)  .  (3.9) 
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Notice  that  the  data  and  prior  appear  in  (3.8)  via  the  posterior  parameters  given  by  (3.3) 
through  (3.6).  Similarly,  from  (2.7),  the  ENG  of  stopping  at  time  t  and  choosing  Hq  is: 


2{pt,Tt)  =  0\  t  =  0, 


(3.10) 


Let  denote  the  expected  net  gain  (ENG),  at  time  t,  of  the  Bayes  procedvire  whose 
truncation  time  is  T.  At  t  —  T,  Hq  ov  H\  must  be  chosen.  The  optimal  terminal-decision 
rule  is,  from  (3.8)  and  (3.10), 


(  0,  if  h(pT,  tt)  <  z(pT,  tt) 

\  1,  otherwise. 


(3.11) 


vnth  ENG 


VriPT^TT)  =  max{z(pr,Tr),h(pr,7‘r)}  • 


(3.12) 


At  <  =  T  —  1,  the  options  are  either  stopping  or  continuing  the  tried  by  sampling 
according  to  the  sample-size  rule  nT~i  •  Conditional  on  the  posterior  parameters  pr-i ,  Tr-i 
(and  the  sample  size  rule),  the  ENG  of  continuing  is 

9T-i{PT-i,TT-i,nT-i)  =  i?FT-i{^/(PT,Tr)|pT-i,'rT-i,’^r-i}  -  2n7'-i  ,  (3.13) 


where  Eft-i  denotes  expectation  with  respect  to  the  predictive  distribution  of  wt-i  given 
by  (3.7).  Notice  that  the  sampling  cost  2nr-i  is  subtracted  from  the  expected  gain  of 
continuing  to  sample.  Since  is  given  by  (3.12),  the  quantity  (3.13)  can  be  computed 
and  compared  with  the  ENG  of  stopping  at  time  t  =  T  —  1. 

More  generally,  at  time  t,  the  ENG  of  continuing  the  trial  to  time  (t  -t- 1)  by  sampling 
according  to  nj  is 

gt(pt,rt,nt)  =  EF,{yt+i(Pt+uTt+i)\pt,Tt,nt]  -  2n(  ;  t  =  O.--  .T  -  1  ,  (3.14) 

where  Ep,  denotes  expectation  with  respect  to  the  predictive  distribution  of  tof+i  given 
by  (3.7).  Thus,  in  order  to  complete  the  backwzu-d  induction  for  the  B-procedure,  we  need 
formulas  for  {V’^_^(pr-^  tt-i)  :  f  =  1,2,  •  •  • ,  T}.  (The  formula  for  V'/  is  given  by  (3.12).) 

Recall  that,  for  the  purposes  of  comparison  to  Berry  and  Ho's  results,  we  have  a 
one-sided  procedure  for  t  =  0.  •  •  • .  T  —  1.  Thus,  from  (3.10)  and  (3.14),  we  obtain 

V,^{pt,T,)  =  max{z{pt,Tt),gt{pt,T,.n,)}  :  f-0.---.T-l.  (3.15) 
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so  that  depends  on  (the  expected  \-alue  of)  Since  is  given  by  (3.12).  the 

backward  induction  can  proceed  to  V^_-^  to  l'-]rL2,  forth,  to  .  In  terms  of 

{(5t,  Aj,  Tit)  :  t  =  0,  -  •  ■  .T  —  1},  the  J5-procedure  is  given  by 


f  1,  if  9t{pt,r,,nt)  <  z{pt,Tt} 

\  0,  otherwise  : 


(3.16) 


A,  =0;  (3.17) 

and  rit  is  prespecified  (f  =  0,  •  •  •  ,T  -  1).  Recall  that,  for  t  =  T,  St  =  1  and  Ar  is  given 
by  (3.11). 

Having  given  an  optimal  stopping  rule  and  an  optimal  terminal-decision  rule,  let  us 
now  turn  our  attention  to  the  ^‘-procedure,  which  in  addition  incorporates  an  optimal 
sample-size  rule.  Define  the  optimal  sample  size  at  time  t  by  the  value  of  Ut  that  maximizes 
(3.14);  specifically, 


nt(pt,rt)  =  argmax„,>o{.9t(P«iT«,nt)};  <  =  1  , 


(3.18) 


where  argmax„>(){*  •  •}  denotes  a  value  of  the  argument  n  that  achieves  the  maximum 
for  the  expression  in  braces.  If  there  are  several  such  values,  choose  the  smallest.  Upon 
substitution  of  {n?  :  <  =  0,  •  •  • ,  T  -  1}  in  place  of  (n*  :  <  =  0,  •  •  • ,  T  -  1}  into  {5t  :  <  = 
0, •  •  •  ,T}  and  {At  :  t  =  0, •  •  •  ,T},  we  obtain  the  ^‘-procedure.  Likewise,  from  (3.15),  its 
ENG  is 

yt^*(pt,r,)  ~  max{z(pi,Tt),gt(pt,rt,nt)}  ;  #  =  0,  •  •  • .  T  -  1  .  (3.19) 

By  definition,  .  In  the  examples  given  in  the  next  section,  we  show  that  the 

B*-procedure  can  yield  a  substantially  larger  ENG.  In  order  to  compute  the  B-procedure 
aj^d  ^'-procedure  from  given  gain  constants  K  and  L,  prior  peurameters  uo  and  Tq  ,  and 
model  parameter  cr^,  %'anous  2inalytictil  results  about  h  and  gt  are  needed.  These  can  be 
found  in  Berry  and  Ho  ( 1988)  and  gueirantee  the  existence  of  breaJc-even  \’alues  69 ,  •  •  ■ .  6t-i 
such  that  (3.16)  can  be  written  as 


^  _  r  1.  if  pt  <bt 
'  \  0,  otherwise;  t  =  0.---,r  — 1. 

Further,  there  exists  a  br  such  that  (3.11)  can  be  written  as 


if  PT  <  bT 
otherwise  . 


(3.20) 


(3.21) 
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Finally,  since  gt  given  by  (3.13)  is  bounded  above  as  a  function  of  n<,  (3. IS)  is  well  defined 
and  hence  the  B*-procedure  can  also  be  given  in  terms  of  brezik-even  values  6^,  ■  •  • .  b^. 

4.  Examples. 

This  section  presents  numerical  results  for  two  different  choices  of  K.L,Tq,  and  cr~. 
(Notice  that  specification  of  the  prior  mean  uq  of  6  is  not  needed  for  performance  evalu¬ 
ations,  since  the  optimal  sequenticJ  procedures  cam  be  given  in  terms  of  {p<}  rather  than 
{lut}.  To  implement  the  procedures  on  data,  uq  has  to  be  specified.)  Using  backward 
induction  to  do  the  computations,  means  that  the  time  taken  to  implement  the  procedxire 
is  linear  in  T. 

We  deliberately  chose  one  example  to  be  that  considered  by  Berry  and  Ho  (1988),  in 
order  to  validate  our  results.  In  this  example,  their  choice  of  n<  =  30  (t  =  0, 1,2  —  T) 
txirns  out  to  be  close  to  optimal.  The  second  example  was  chosen  to  show  that  such  an 
ad  hoc  choice  of  sample  size  can  be  far  from  optimal  and  that  the  B* -procedure  cam  offer 
considerable  improvement. 

4.1.  Example  1. 

Consider  the  constants  specified  by  Berry  and  Ho  (1988),  namely,  K  =  5000,  L  =  2000, 
Tg  =  1,  and  <7^  =  2.  It  is  instructive  to  consider  initially  the  case  T  =  1.  Figure  1  shows  a 
contour  plot  of  the  ENG  goipo,  1,  no),  given  by  (3.14),  as  a  function  of  po  on  the  horizontal 
axis  and  no  on  the  vertical  axis.  The  optimal  nj,  as  a  function  of  po,  will  follow  a  ridge, 
but  it  is  clear  that  the  contour  lines  axe  essentially  parallel  and  that  Berry  and  Ho’s  choice 
of  no  =  30  is  as  optimal  as  any  sample  size  in  the  range  [20, 50]. 

Figure  1  here 

.•\nother  way  to  make  the  comparison  is  to  superimpose  go  as  a.  function  of  po,  for  severed 
chosen  values  of  no.  Figure  2  shows  go(*,  l,no);  no  =  0, 1, 30,  and  100,  on  the  same  graph. 
For  most  of  po,  no  =  30  has  the  largest  ENG. 

Figure  2  here 

Figure  3  shows  the  optimal  sample  size  nj  as  a  function  of  po  for  the  ceises  T  =  1,2. 
and  3.  .A.s  T  increeises,  initial  optimal  sample  sizes  tend  to  decrease,  which  is  sensible  since 
there  are  more  times  at  which  further  samples  could  be  taken.  The  break-even  \ulues  are 
b*  =  0.979.0.984,  and  0.984,  for  T  =  1,2,  amd  3,  respectively. 

Figure  3  here 
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The  ENG  goiPo,  T  ^o(Po))  is  the  largest  possible  ENG  associated  with  the  decision  to 
continue  sampling;  selected  values  for  T  =  1.2,  and  3  are  given  in  Table  1.  For  fixed  po. 
these  \’alues  are  increasing  in  T.  Interestingly,  the  table  shows  that  there  is  little  to  gain 
by  specifying  a  truncation  time  beyond  T  =  2. 

Table  1  here 

4.2.  Example  2. 

Consider  the  constants  K  =  100,  L  =  5000,  Tq  =  1,  end  cr^  =  0.5.  these  \’alues  were 
chosen  from  among  several  combinations  because  they  illustrate  clearly  the  advantage  of 
the  ^‘-procedure  over  the  B-procedure.  To  compare  with  Example  1,  consider  the  case 
T  =  1.  Figure  4  shows  a  contour  plot  of  the  ENG  po(po,  1, no)  as  a  function  of  po  and  no 
(cf.  Figure  1).  The  ridge  that  traces  out  the  optimal  value  nj(po)  as  a  function  of  po  is 
rather  pronounced  and  gives  values  of  nj  around  5.  An  ad  hoc  choice  of  no  =  30  is  far 
from  optimal.  To  illustrate  this  point.  Figure  5  shows  the  percentage  increase  of  the  ENG 
9o{poi  T  ”o(Po))  compared  to  the  ENG  po(po>  30).  Substantial  improvements  in  expected 
net  gain  are  possible  using  an  optimal  choice  of  sample  size. 

Figure  5  here 

Figure  6  shows  the  optimal  sample  size  nj  as  a  function  of  po  for  the  C2ises  T  =  1,2, 
and  3.  The  break-even  values  are  6^  =  0.737,0.754,0.754,  for  T  =  1.2.  and  3,  respectively. 

Figure  6  here 


5.  Discussion. 

The  Bayesiein  sequential  approach  relies  heavily  on  the  specification  of  a  gain  structure. 
In  medical  trieds  where  human  lives  are  at  stake,  this  will  be  a  difficult  (if  not  impossible) 
task.  However,  the  classical  sequential  trial  (e.g.,  WTiitehead,  1983)  is  once  removed  from 
this  type  of  specification. 

In  this  article,  we  have  demonstrated  that  the  Bayesian  approach  to  sequential  sam¬ 
pling  can  be  extended  to  include  an  optimal  sample-size  rule,  which  is  a  function  of  the 
posterior  distribution.  The  rule  can  be  calculated  before  any  sampling  takes  place.  It  can 
either  be  adhered  to  strictly  throughout  the  sequential  trial  or  can  be  used  to  obtain  an 
approximately  optimal  sample  size  for  a  group-sequential  procedure. 

For  example,  consider  the  case  T  =  1  in  Section  4.1;  uq  =  20  yields  an  ENG  that  is 
close  to  goipot  1.  ^oipo))  over  much  of  0  <  po  <  1.  Vsdues  of  ng  for  larger  T  tend  to  be  less 
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than  for  r  =  1  and.  for  fixed  T  >  1.  this  is  also  true  for  n* ,  ■  •  • .  .  Thus,  a  conser\’ative 

specification  of  the  total  sample  size  needed  for  the  B*-procedure  is  20T.  Alternatively, 
a  group- sequential  S-procedure  with  no  =  •  •  •  =  tit-\  =  20  would  achieve  ENGs  close  to 
those  of  the  ^'-procedure.  (Berry  and  Ho's  choice  of  no  =  •  ■  •  =  nr-i  =  30  would  yield  a 
B-procedure  with  similar  ENG  properties,  however  it  would  need  more  samples.) 

Similzir  reasoning  applied  to  Section  4.2  yields  a  conservative  specification  of  the  B*- 
procedure’s  total  sample  size  to  be  oT.  Or,  a  group-sequential  B-procedure  with  no  = 
•  •  •  =  nT-\  =  5  would  achieve  ENGs  close  to  those  of  the  B'-procedure.  However,  a 
B-procedure  with  no  =  •  •  •  =  nr-i  =  30  would  not  be  appropriate  at  all. 

Although  Berry  and  Ho  (1988)  chose  to  carry  out  a  one-sided  sequential  pharmaceu¬ 
tical  trial,  it  is  easy  to  adapt  their  approach  to  handle  the  two-sided  case.  In  that  case, 
(3.16)  becomes 

„  _  f  1.  if  gt{Pt,Tt,nt)  <  max{z{pt,Tt),h{pt,Tt)}  _ 

‘  1  0,  otherwise;  <  =  0,  1, 


(3.17)  becomes 


(3.15)  becomes 


if  h{pt,Tt)  <  z{pt,Tt) 
otherwise;  #  =  0,  •  •  • ,  T  —  1. 


(6.2) 


V^iPt^Tt)  =  max{z(p(,rt),h(pt,rt),fift(pt,rt,n«)}  ;  t  =  0,  •  •  • ,  T  -  1  ,  (6.3) 

and  St,  At,  and  remain  unchanged.  To  optimize  on  sample  size,  again  use  (3.18)  to 
choose  n*  and  substitute  {n*}  in  place  of  {n*}. 

Although  the  probability  of  type  1  error(a),  power(7r),  and  expected  total  sample 
size  axe  not  considered  by  Bayes  sequentieJ  procedures,  there  is  evidence  that  a  superior 
performance  can  also  be  expected  in  terms  of  these  criteria.  In  the  case  of  a  simple  null 
hypothesis  versus  a  simple  alternative  hypothesis  for  testing  Gaussian  means,  Cressie. 
Biele.  and  Morgan  (1991)  demonstrate  the  B*-procedure's  small  a  and  large  tt  over  much 
of  0  <  po  <  1.  Moreover,  Cressie  and  Morgan  (1992)  show  that  of  all  sequential  procedures 
with  a  given  a  and  tt.  the  B*-procedure  minimizes  the  expected  total  sample  size. 
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FIGURE  CAPTIONS 


Figure  1:  Contour  plot  of  ENG  gaipo.,  l.rio  j  as  a  function  of  po  and  uq.  On  the 

horizontal  axis,  0.1  <  po  <  0.6,  and  on  the  vertical  axis,  no  ranges  from 
5  to  50  in  steps  of  size  5.  Constants  axe  given  in  Section  4.1:  T  =  1. 

Figure  2:  Plot  of  ENG  ^o(po,  li^^o)  as  a  function  of  po,  for  no  =  0, 1, 30,  and  100. 
On  the  horizontal  axis,  0.10  <  po  <  1.00.  Constants  axe  given  in 
Section  4.1;  T  =  1. 

Figure  3;  Optimal  sample  sizes  nj  and  aissociated  continue  szunpling  intervals 
[0,  i*)  for  T  =  1,2,  and  3.  Constants  axe  given  in  Section  4.1. 

Figure  4:  Same  as  for  Figure  1  except  no  ranges  from  1  to  21  in  steps  of  size  2 
and  constants  axe  given  in  Section  4.2. 

Figure  5:  Percentage  increase  of  ENG  5o{po,  1»  n*{po))  over  ENG  po(po^  1, 30). 
Constants  axe  given  in  Section  4.2;  T  =  1. 

Figure  6:  Same  cis  for  Figure  3  except  constants  are  given  in  Section  4.2. 
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